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Description 

BACKGROUND OF THE INVENTION 

[0001] The present invention relates to a processing 
unit and, in particular, to a central processing unit (CPU) 
such as a microcontroller. Microcontrollers are well 
known in the art, and a large variety of such devices from 
low end devices (e.g., a 4/8-bit controller) to high end 
devices (e.g., a 32-bit controller) are available. High end 
controllers are used in almost any kind of electronic de- 
vices, such as laser printers and hard disks or for con- 
trolling any kind of processes, for example, in automo- 
biles, etc. 

[0002] With more complex tasks to be solved, the de- 
mand for very fast, high end controllers is rising. In par- 
ticular, such devices are necessary to control real time 
applications. In such applications, it is necessary to react 
very fast to external events. If such an event occurs, the 
content of at least a specific set of registers needs to be 
switched. If this context switching is faster, the reaction 
time of the control system is better. 
[0003] EP-A-0 745 933 (IBM) 4 December 1996 
(1 996-1 2-04) discloses a multiple port register file for use 
with Very Long Instruction Word (VLIW) and N-way su- 
perscalar processors. Said register file includes a select- 
ed number of copies of a general purpose register build- 
ing block. The block includes two interleaved sub-banks 
of registers (odd and even registers) each comprising a 
number n of write ports and a number m of read ports. 
The sub-banks are interleaved by write ports and have 
non-interleaved read ports. 

SUMMARY OF THE INVENTION 

[0004] Therefore, it is an object of the present invention 
to provide a processing unit with the capability of very 
fast read/write access to address and data registers in a 
CPU. 

[0005] This object is achieved by means of a data 
processing unit as set out in claim 1 having a set of data 
registers and a set of address registers. Each register 
has a width of n bits. Furthermore, there are provided 
address load and store buffers associated with said ad- 
dress registers, data load and store buffers associated 
with said data registers and a bus having a plurality of 
bus lines being connected to said store buffers. A data 
memory unit is connected to said bus. The data registers 
are arranged in such a way that at least n data registers 
are connected in parallel to respective bus lines, n being 
greater than 1 and the address registers are arranged in 
such away, that at least m address registers are coupled 
in parallel to respective bus lines, m being greater than 
1 . Thus, at least four registers can be accessed in par- 
allel. 

[0006] If, for example, 4 data registers and 4 address 
registers can be loaded or stored simultaneously, then a 
32-bit microcontroller-bus has to be 256 bits wide. The 



advantage of such an arrangement is, that a context 
switch can be done very fast because 8 registers are 
stored and loaded in only 2 respective cycles. This allows 
a very fast reaction time to external events. Also, process- 
5 ing time is speed up because commands can be imple- 
mented that provide parallel processing. That is, more 
than one register can be loaded or stored at the same 
time. 

[0007] To enable multi-register read and write proce- 
10 dures, each register is provided with multiple uni- and/or 
multiple bidirectional ports, which couple the registers 
with a memory unit and/or with an address arithmetic 
pipe or an integer pipe or any other processing unit. The 
registers can be divided into different sets of registers, 
15 for example, even and odd registers. Thus, an optimized 
layout of the respective registers is possible, while ac- 
cessibility to the registers is maintained. Each of the 
above mentioned lines can be selected separately, and 
the lines are linked to odd or even memory addresses. 
[0008] A wide variety of implementations is possible. 
Either all register cells communicate through one multi- 
ple-word-wide bus, whereby each register cell has a plu- 
rality of read/write lines which are connected to each word 
lines of the multiple-word-wide bus, or different set of 
registers connect to respective word lines of the multiple- 
word-wide bus which connects to the memory. Multiplex- 
ers and alignment units can couple any register with any 
address location within the memory. 
[0009] Further, a unit is provided which handles zero 
and sign extension for byte and half-word load instruc- 
tions. The address registers can be connected to an ad- 
dress arithmetic unit or any other processing unit through 
an address forward buffer, and the data registers can be 
coupled to a multiply accumulator unit (MAC) and/or In- 
teger execution unit or any other processing unit through 
a respective data forward buffer. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0010] 

Figure 1 shows afunctional block diagram of a reg- 
ister file according to the present invention, 

Figure 2 A shows a data load and store buffer unit. 
Figure 2B shows an address load and store buffer 
unit. 

Figure 2C shows a data register file unit. 
Figure 2D shows an address register file unit, 
Figure 2E shows a data forward unit. 
Figure 2F shows an address forward unit, 
Figure 3 shows a address register cell in more detail, 
and 

Figure 4 shows a data register cell in more detail. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

[001 1 ] Figure 1 shows an overview of the general pur- 
pose register file of an 32 bit microcontroller according 
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to the present invention. Only one plane showing one 
single bit of each register is shown in this drawing. The 
general purpose register file 1 comprises an address reg- 
ister file 2 and a data register file 3 that contain a plurality 
of address and data registers 4, 7 with a predetermined 
bit width and buffers 5, 6, 8, 9. In the preferred embodi- 
ment the register bit width is 32 bits, and the number of 
registers is 16. This bit width can be any bit size according 
to the specification of the respective microcontroller, mi- 
croprocessor or data handling unit. 
[0012] On the one hand, the address registers 4 are 
connected via a bus 1 3 to an address forward unit 5 and 
on the other hand to address load and store buffer 6. 
Further, the data register file 1 contains a plurality of data 
registers 7, for example, 1 6 data registers with a bit width 
of 32 bit. A second bus 14 is provided which connects 
the data registers 7 on the one hand to data load and 
store buffers 9 and on the other hand to data forward unit 
8. The general purpose register file 1 is on the one hand 
connected to a data memory unit 1 0 through a third bus 

1 5 and on the other hand to additional buses 16,17. Bus 

16 connects the address forward buffers 5 and parts of 
the data forward buffers 8 to an unit 12 comprising, for 
example, an address arithmetic unit and an instruction 
fetch unit (IFU). Bus 17 connects the data forward buffers 
8 to an unit 1 1 comprising, for example, a multiply accu- 
mulator (MAC) and an integer execution unit. 

[0013] Figures 2A through 2F show the elements of 
the general purpose register file 1 in more detail. Only 
one bit of each register d0...d1 5, a0...a15 is shown in this 
diagram. The other 31 bits of the register d0...d15, 
a0...a1 5 are arranged in parallel. Throughout the Figures 
2A to 2F the same numerals represent the same ele- 
ments. 

[0014] Figure 2D shows registers in an address regis- 
ter file. For a better overview, registers a2 to a1 3 are not 
shown in Figure 2D. Every address register aO,... a15 of 
the address register file 4 has two uni-directional outputs 
for reading and three bidirectional l/O-ports. One internal 
bus line 40 is connected to the first bidirectional l/O-port 
of all address registers aO, ... a15, and two lines 41 and 

42 are connected to the respective two outputs of all ad- 
dress registers aO,... a15. Further, two internal bus lines 
44 and 46 are connected to respective l/O-ports of all 
even address registers aO, a2, a4 ... a14. Two more lines 

43 and 45 which are connected to the l/O-ports of all odd 
address registers a1, a3, a5...a15. 

[0015] Figure 2A shows the address load and store 
buffer 6 of Figure 1 in detail. Bus lines 43 to 46 connect 
to the respective inputs of store buffers 101 to 1 04. The 
outputsofstore buffers 101 to 104 are connected to mem- 
ory bus lines 1 19 to 122 through drivers 106 to 109. All 
drivers are tri-state drivers which are provided with a con- 
trol input for selective activation by means of a control 
unit (not shown). Two blocks 1 00 and 1 05 indicate 4 load 
buffers which are connected on the one hand to the mem- 
ory bus lines 1 1 9 to 122 and on the other hand through 
drivers 1 10 to 1 13 to the internal bus lines 43 to 46. In 



addition, the outputs of the load buffer 1 05 are connected 
through buffers 115 and 116 to the internal bus lines 43 
and 44. 

[0016] Figure 2E shows the address fonward unit 5 of 

5 Figure 1 in detail. Bus lines 40, 41 and 42 are connected 
to address forward output bus lines 524, 525 and 526 
through drivers 500, 501 and 502, respectively. Two 
memory bus lines 121 and 122 are each connected to 
the input of three drivers 503, 504, 505 and 506, 507, 

10 508. The outputs of drivers 503 and 506 connect to bus 
line 526, the outputs of drivers 504 and 507 to bus line 
525 and the outputs of drivers 505 and 508 to bus line 
524. Internal bus lines 43 and 44 are connected to the 
outputs of two drivers 518 and 519, respectively. The 

15 inputs of drivers 51 8 and 51 9 are connected to the output 
of awrite-backbuffer515the input of which is connected 
to an address forward bus input line 523. Two other driv- 
ers 516 and 517 are provided the output of which are 
coupled with the internal bus line 40. The input of driver 

20 517 is connected to the output of buffer 515. The input 
of driver 51 6 is connected to the output of buffer 514 the 
input of which is connected to address forward bus input 
line 522. Bus lines 45 and 46 are also connected to bus 
lines 522 and 523 through bidirectional drivers 520 and 

25 521 , respectively. The signal on bus line 523 is also fed 
to the inputs of three drivers 509, 51 0 and 51 1 the outputs 
of which are connected to bus lines 524, 525 and 526, 
respectively. The signal on bus line 522 is also fed to the 
inputs of two drivers 512 and 513 the outputs of which 

30 are connected to bus lines 524 and 525, respectively. 
Again, all drivers are tri-state drivers which are provided 
with a control input for selecting by means of a control 
unit (not shown). The three units 4, 5 and 6, according 
to Figure 1 , show the path of the address signals into and 

35 out of the address register file, whereby bus lines 1 1 9 to 
1 22, according to Figure 2A, connect to the memory unit 
10 and bus lines 522 to 524 to the address arithmetic 
pipe and the instruction forward unit 12. 
[0017] Figure 2C shows data registers in a data regis- 

^0 ter file. Again, for a better overview, registers d2 to d13 
are not shown in Figure 20. Every data register dO,... d1 5 
of the data register file 5 has three uni-directional outputs 
for reading and two bidirectional l/O-ports. Two lines 72 
and 73 are connected to the first two outputs of all data 

^5 registers dO,... d1 5, respectively. Further, an internal bus 
lines 71 is connected to respective outputs of all even 
data registers dO, d2, d4 ... d14, whereas an internal bus 
line 70 is connected to respective outputs of all odd reg- 
isters d1 , d3, ... d1 5. Two more lines 74 and 76 are con- 

50 nected to the l/O-ports of all odd address registers d1, 
d3, d5...d15. Finally, two other lines 75 and 77 are con- 
nected to the l/O-ports of all even address registers dO, 
d2, d4 ...d14. 

[001 8] Figure 2B shows the data load and store buffer 
55 9 of Figure 1 . Internal bus lines 74 to 77 are coupled to 
the inputs of four store buffers 909 to 912, respectively. 
The output of store buffer 909 is coupled to a memory 
bus line 921 through a driver 91 6, and the output of store 
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buffer 910 is coupled to a memory bus line 920 througli 
driver 91 7. The outputs of buffers 91 1 and 91 2 are con- 
nected to two input terminals of a mask unit 915. Mask 
unit 915 is provided with two input and two output termi- 
nals. The two output terminals of the mask unit 915 are 
coupled to two memory bus lines 922 and 923 through 
drivers 918 and 919, respectively. Two blocks 908 and 
913 depict four load buffers. Block 913 is provided with 
two input terminals which are connected to two internal 
bus lines 900 and 901 , respectively and to the memory 
bus lines 922 and 923 through an align block 914. The 
align block 91 4 is provided with two inputs which are con- 
nected to the inputs of mask unit 91 5 and the outputs of 
the store buffer 913, respectively. Furthermore, one out- 
put of load buffer 913 is coupled to internal bus lines 74 
and 76 through respective drivers 904 and 906. The other 
output of load buffer 913 is coupled to internal bus lines 
75 and 77 through drivers 905 and 907. The second load 
buffer 908 is connected on its input side to memory bus 
lines 920 and 921 , respectively and on its output side to 
internal bus lines 74 and 75 through drivers 902 and 903, 
respectively. Again, all drivers are tri-state drivers which 
are provided with a control input for selecting by means 
of a control unit (not shown). 

[0019] Figure 2F shows data forward unit 8 of Figure 

1 in detail. Internal bus lines 70 to 73 are coupled with 
the data forward bus lines 802 to 805 through drivers 807 
to 810. An additional data forward bus line 806 is con- 
nected to output terminals of drivers 827 and 828 the 
input terminals of which are connected to bus lines 802 
and 803, respectively. Bus line 806 is part of the bus 1 6, 
according to Figure 1 , which is connected to the address 
arithmetic pipe and instruction forward unit 12. Bus line 
900 is connected to the inputs of three drivers 81 1 , 812 
and 81 3 the outputs of which are connected to bus lines 
805, 804 and 802, respectively. Bus line 901 is connected 
to the inputs of three drivers 81 4, 81 5 and 81 6 the outputs 
of which are connected to bus lines 805, 804 and 803, 
respectively. Two other data forward bus lines 800 and 

801 are provided which are coupled to the input terminals 
of two write-back buffers 829 and 830, respectively. The 
output terminals of buffers 829 and 830 are coupled to 
bus lines 74 and 75 through drivers 832 and 834, respec- 
tively. In addition, the output of buffer 830 is coupled to 
bus line 74 through a driver 833. Bus line 800 is coupled 
to bus line 76 through driver 835, and bus line 801 is also 
coupled to bus lines 76 and 77 through drivers 836 and 
837, respectively. Also, bus line 806 is coupled to bus 
lines 76 and 77 through drivers 840 and 841 , respectively. 
Furthermore, bus line 806 is connected to the input ter- 
minal of a further write-back buffer 831 the outputof which 
is coupled to bus lines 76 and 77 through drivers 838 and 
839, respectively. Also, bus line 806 is coupled to bus 
lines 802 to 805 through drivers 81 7 to 820, respectively. 
Finally, bus line 800 is coupled to bus lines 805, 804 and 

802 through drivers 821 to 823, respectively and bus line 
801 is coupled to bus lines 803 to 805 through drivers 
824 to 826, respectively. Again, all drivers are tri-state 



drivers which are provided with a control input for select- 
ing by means of a control unit (not shown). The memory 
bus lines 1 19 to 122, according to Figure 2A, and 920 to 
923, according to Figure 2B, form the memory bus 15 of 
5 Figure 1 . The bus lines 522 to 526, according to Figure 
2E, and 806 form bus 16 and bus lines 800 to 805 form 
bus 17 of Figure 1. 

[0020] Figure 3 shows an embodiment of an address 
register cell. Figure 3 shows numerals which are related 

10 to address register aO according to Figure 2D. Preferably 
all registers aO .. A15 have the same structure. The reg- 
ister core is formed by two inverters 400 and 401 . These 
inverters 400 and 401 build a static memory cell, so that 
the output of each inverter is connected to the input of 
the other inverter. Input of inverter 400 and output of in- 
verter 401 are coupled to bus lines 40, 44 and 46 through 
the load path of field effect transistors 402, 403 and 406. 
Each of field effect transistors 402, 403 and 406 forms a 
transfer-gate. The output of inverter 400 and the input of 
inverter 401 is connected to the inputs of five controllable 
tri-state drivers 404, 405, 407, 408 and 409. The output 
of driver 404 is connected to bus line 40, the output of 
driver 405 to bus line 44 and the output of driver 407 to 
bus line 46. The output of driver 408 connects to bus line 
41 and the output of driver 409 to bus line 42. 
[0021] Figure 4 shows an embodiment of a data reg- 
ister cell. Again, Figure 4 shows numerals which are re- 
lated to data register dO according to Figure 2C. Prefer- 
ably, all registers dO .. d15 have the same structure. The 
register core is formed by two inverters 700 and 701. 
These inverters 700 and 701 build a static memory cell, 
so that the output of each inverter is connected to the 
input of the other inverter. Input of inverter 700 and output 
of inverter 701 are coupled to bus lines 75 and 77 through 
the load path of field effect transistors 702 and 703. The 
output of inverter 700 and the input of inverter 701 are 
connected to the inputs of five controllable tri-state driv- 
ers 704, 705, 706, 707 and 708. The output of driver 704 
is connected to bus line 75 and the output of driver 705 
to bus line 77. The output of driver 706 connects to bus 
line 71 , the output of driver 707 to bus line 72 and the 
output of driver 708 to bus line 73. 
[0022] All other registers a1 ... a15 and d1 ... d15 are 
coupled to the respective bus lines as shown in Figures 
2C and 2D. The transfer-gates 402, 403, 406, 702 and 
703 (see Figures 3 and 4) and the tri-state drivers 404, 
407, 408, 409, 704, 705, 706, 707 and 708 are control- 
lable through respective select input terminals. The con- 
trol circuit (not shown) generates the respective control 
signals to control the operation of these elements along 
with the other controllable drivers in the general purpose 
register file 1 (see Figure 1). Only one of the transfer- 
gates of a register can be activated at a time for write 
operations whereas a plurality of tri-state drivers can be 
active in parallel for any read operation. 
[0023] The general purpose registers handle all data 
which are processed by a data handling unit, such as a 
microcontroller/microcomputer. Therefore, they are part 



20 



25 



30 



35 



40 



45 



50 



4 



7 



EP 0 902 360 B1 



8 



of the microprocessor/microcontroller core, and the way 
they handle the data has an important influence on the 
resulting speed of such a data handling unit. The general 
purpose register file 1 (see Figure 1) supplies most fre- 
quently used operands to the integer and load and store 
pipelines in the units 11, 12. The load and store instruc- 
tions in the instruction set architecture provide access to 
the data memory 1 0 where a majority of operands reside 
before being brought into the register file 1 . As shown in 
Figure 1 , the functional block diagram for the register file 
is divided into address register file 2 and data register 
file 3. The data flow through the register files depends 
on the respective instruction which is being processed 
by the data handling unit. 

[0024] According to the invention, data and address 
register block 4 and 7 are arranged in two separate 
blocks. Data registers in block 7 are arranged in a first 
matrix having c1 columns and r1 rows and address reg- 
isters in block 4 are arranged in a second matrix having 
c2 columns and r2 rows. In the preferred embodiment, 
both matrixes are equal whereby 2 columns and 8 rows 
are provided. Nevertheless, in this embodiment, 4 data 
registers in block 4 and 4 address registers in block? are 
arranged in parallel, so that 8 registers, 4 even and 4 odd 
registers, can be accessed through bidirectional ports on 
the memory interface side at the same time. Therefore, 
in this example, the bus 15, which interfaces on the one 
hand with the data memory unit and on the other hand 
with the below described data processing units, needs a 
width of 256 bits. 

[0025] On the processing unit side, the address regis- 
ter file 4 interfaces with any processing unit via bus 16. 
Bus 1 6, in this embodiment provides 6 bus lines, whereby 
5 bus lines 522 .. 526 (see Figure 2E) connect to the 
address register file. The data register file 7 interfaces 
with any processing unit via bus 17. Bus 17, in this em- 
bodiment, provides 6 bus lines. Every register cell, in this 
embodiment, according to Figures 3 and 4, has 5 ports 
whereby two bidirectional ports are assigned to either 
even or odd registers. Therefore, in this embodiment, 7 
data registers and 5 address registers can be accessed 
at the same time and provide data to any processing unit 
whereby some registers can only be accessed through 
the read port. 

[0026] Depending on the operating system running the 
microcontroller, this arrangement results in access to a 
plurality of general purpose registers of the core in one 
single cycle. According to the invention, m and n are in- 
teger numbers greater than 1 such that a minimum of 4 
registers are accessible at the same time. Splitting the 
assignment of some ports into even and odd register sets 
can double this number. A context switch in a real time 
operating system may need 8 data registers and 8 ad- 
dress registers to be switched. In the preferred embodi- 
ment, this could be done in only 2 cycles, because 8 
registers are accessible at the same time. If only 4 reg- 
isters need to be changed for a context switch, this may 
be accomplished in a single cycle. 



[0027] According to the invention, each register is pro- 
vided with a plurality of uni-directional and a plurality of 
bidirectional ports. This allows more registers to be ac- 
cessed at the same time and also a wide variety of dif- 
5 ferent instructions to be executed in a minimum of time. 
This occurs in parallel as described below. These uni- 
directional and bidirectional ports can be coupled one to 
one to the bus which is connected to the data memory. 
This allows access to as many registers as ports are pro- 
^0 vided at the same time, or the registers are split into even 
and odd register sets which are coupled to even and odd 
bus lines. In the embodiment according to Figures 2A 
and 2B, the memory bus lines 119 and 121 as well as 
the memory bus lines 920 and 922 are coupled to odd 
^5 registers in the address and data register file. Bus lines 
1 20 and 1 22 as well as 921 and 923 are coupled to even 
registers, respectively. Therefore, only two ports of each 
register are used to interface with 4 registers of each 
register file 4 and 7 at the same time. On the data process- 
ing side, the internal bus can comprise additional bus 
lines which connect to unidirectional and bidirectional 
ports of the registers. In this embodiment, these bus lines 
are 802 ... 806 and 524... 526 (see Figures 2F and 2E). 
Splitting between even and odd registers is optional for 
these bus lines. Even more registers can be accessed 
at the same timeon the processing unit side, as described 
above. 

[0028] The bidirectional 256-bit bus interface on the 
data memory side of register file 1 supports the transfer 
of operands between data memory 10 and register file 
1 . This bus 1 5 could also be 1 28 bit wide or use 1 28-bits 
of 256-bit bus per cycle depending on the above de- 
scribed structure of the registers. The design of this bus 
can depend on power and noise related problems. Es- 
sentially, this bus has to support parallel access to at 
least 4 registers. 

[0029] The design according to the invention allows 
byte, half word, word and double-word transfers in a sin- 
gle cycle. The context save or restore operation allows 
for the transfer of 16 words in two cycles as mentioned 
above (four cycles on a 128-bit bus). However, the reg- 
ister file 1 is capable of saving or restoring the entire 
context (32 words) in 4 cycles (or eight cycles on 1 28-bit 
bus). In general, the register file 1 supports word, double- 
word and quadruple-double word accesses. The data 
steering for byte, half-word and unaligned accesses is 
handled in the address load and store buffers 6, the data 
load and store buffers 9 and/or in the DMU module 10. 
[0030] The general purpose register file 1 of this em- 
bodiment contains thirty two 32-bit registers which, ac- 
cording to the principle of the present invention, are 
equally split to facilitate parallel operation of integer, and 
load and store instructions as described above. The data 
portion of registers, which supply data operands to the 
integer pipeline, reside in the data register file block 7 
(DGPR). The DGPR 7 contains sixteen 32-bit data reg- 
isters. The address portion of registers which supply ad- 
dress operands to the load and store pipeline reside in 
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the address register file block 4 (AGPR). The AGPR 4 
contains sixteen 32-bit address registers. The instruction 
normally provides the register addresses for read and 
write accesses. The read and write address pointers are 
derived from the instruction in the instruction fetch stage 
and staged into the decode stage. The address in the 
read pointers is decoded to read the register operands 
in the decode stage. The address in write pointers is 
staged to the write-back stage. The decision for operand 
fonwarding is made in the decode stage to minimize crit- 
ical paths in the instruction fetch stage. 
[0031] Context instructions, with implied address se- 
quence, access a group of registers at a time in each 
register file 2, 3. For example, during context transfer, 
address registers aO, a1 , a4, and a5 are accessed to- 
gether during the first cycle, and address registers a2, 
a3, a6 and a7 are accessed together during the next cy- 
cle. 

[0032] In this embodiment, the MAC instruction dic- 
tates the number of read and write ports in the data reg- 
ister file 7 which interface with the integer pipeline in unit 
1 1 . Also, the context access dictates the requirement of 
number of bidirectional l/O-Ports in the data register file 
7 which interface with the DMU 10. The structure of a 
single register has been explained in accordance with 
Figures 3 and 4. Due to the need for execution of integer 
and load and store instructions in parallel, ports are ded- 
icated in the data register file 7 to interface exclusively 
with the integer pipeline of unit 1 1 and DMU 1 0. The read 
and write ports in DGPR 7 allow parallel operation of 
multiply- and load-instructions or of multiply- and store- 
instructions. The multiply-instruction in this embodiment, 
for example, requires four 32-bit read ports represented 
by bus lines 802 to 805 (see Figure 2F) and two 32-bit 
write ports represented by bus lines 800 and 801. The 
context access needs four 32-bit bidirectional ports rep- 
resented by bus lines 920 to 923 (see Figure 2B) and 
internal buses 900 and 901 which are separate from the 
read ports dedicated for integer pipeline in unit 1 1 . The 
load- and store-instructions could share two out of four 
context ports and internal buses 74 to 77 (see Figure 2B) 
which could be referred to as load and store ports and 
buses. The remaining two internal context buses could 
be shared by the integer pipeline for write-back operation 
which could be referred to as write-back buses. 
[0033] Many address arithmetic instructions require 
two 32-bit read ports and one 32-bit write port in the ad- 
dress register file 4 to interface with the address arith- 
metic pipeline in unit 1 2. In this embodiment, the context 
access dictates the number of bidirectional l/O-ports in 
the address register file 4 which interface with the DMU 
10. In the preferred embodiment, the context access 
needs four 32-bit bidirectional ports represented by bus 
lines 1 1 9 to 1 22 (see Figure 2A) and internal buses which 
are separate from the read ports dedicated to the address 
arithmetic pipeline. In other words, one read/write port 
for each register which can be accessed in parallel must 
be provided. The number of ports therefore depends on 



the wide of the bus, or on the number of words which can 
be transferred in parallel on this bus. The load- and store- 
instructions could share two out of four context ports and 
two out of four internal buses which could be referred to 

5 as load and store ports and buses. The remaining two 
internal context buses could be shared by the address 
arithmetic pipeline for write-back operation which could 
be referred to as write-back buses. A dedicated 32-bit 
bidirectional port represented by bus lines 800 (see Fig- 

10 ure 2F) handles transfers between the instruction fetch 
unit and the special function registers of unit 12 and ad- 
dress register file 4. Context and stack pointers share 
the write back bus to interface with general purpose file 1 . 
[0034] Due to the limitation on the number of read and 

^5 write ports available in the register file 1 , instructions in- 
volving both data and address registers share existing 
ports only. As a result, in this embodiment, many address 
arithmetic operations when issued in parallel with integer 
instructions may need to be canceled in the current cycle 

20 and are-injected in the following cycle. A dedicated 32- 
bit bidirectional port 806 (see Figure 2F) in the register 
file solves the data transfer problem between data and 
address register files. The data operand from a read port 
in the data register file 7 is steered to a read port in the 

25 address register file 4 through the bidirectional port. The 
result from the address arithmetic pipeline is steered to 
the load and store buses in the data register file 7 through 
the write port in the address register file 4 and the bidi- 
rectional port for write-back operation. 

30 [0035] A read-modify-write operation is performed to 
access the link word from the data memory during context 
save or restore operation. The context save operation 
uses a pointer to access the link word. During context 
save operation, (a) the link word is loaded into the pointer 

35 register, (b) the contents from the pointer register are 
stored into a link word address location (as pointed to by 
the old pointer contents), and (c) the pointer register as- 
sumes the old pointer contents. The context restore op- 
eration uses the pointer register to access the link word. 

"^0 During context restore operation, (a) the link word is load- 
ed into the pointer register, (b) the contents from the 
pointer register are stored into a link word address loca- 
tion (as pointed to by the old pointer register contents), 
and (c) the pointer register assumes the old pointer con- 

45 tents. 

[0036] The register file 1 interfaces with the DMU 10 

on one side, and on the other side directly connects to 
the address arithmetic pipeline IFU 12 and to MAC and 
integer pipelines 1 1 . The controls for accessing the reg- 

50 ister file are derived from the instructions. To provide a 
better overview, these lines are not shown in Figures 2A 
to 2F. Address registers 4 are read in the first phase of 
the decode (or execute for store operand) cycle and writ- 
ten in the first phase of the write-back cycle to avoid hav- 

55 ing to forward a write-back result. For example, when the 
next+1 instruction needs the result from the current in- 
struction (next being in the middle), the operand will be 
read directly from the address registers 4. Data registers 
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7 are read in the second phase of the decode (or execute 
for store operands) cycle and written in the second phase 
of the write-back cycle. Accessing address and data reg- 
isters in alternate phases reduces peak power. The data 
register file 3 consists of the following blocks: data gen- 
eral purpose registers 7, load and store buffers 9 for data 
and, data forwarding buffers 8. The address register file 
2 consists of the following blocks: address general pur- 
pose registers 4, load and store buffers 6 for address 
and, address forwarding unit 5. 

[0037] In the preferred embodiment, the data register 
file blocks contains 1 6 data registers dO, .. d1 5. The basic 
data register cell in the data register file 7 has three read 
ports 71, 72, 73 (see Figure 2F) and two bidirectional 
ports 75 and 77. Two read ports 72 and 73 connect to all 
sixteen registers dO, ... d1 5 which enable access to any 
of these registers dO, ... d15. However, the read ports 
70, 71 are connected either to even or odd registers, so 
that even registers dO, d2,... d14 are accessed through 
even port 71 and odd registers d1, d3, ... d15 are ac- 
cessed through odd port 70. The IVIAC and other instruc- 
tions need a third read operand. The MAC instruction 
requires a 64-bit accumulate operand which needs no 
further steering because it is already aligned to the even 
address boundary. However, if the third operand is a 32- 
bit data as in some special instructions, depending on 
the register address, the operand could show up on the 
even or odd read port 71 , 70 which needs further steering 
in the integer pipeline. The operand steering for the spe- 
cial instruction could be handled in the integer pipeline 
in its own input multiplexers which avoids the need of 
sending the controls to the register file 1 . This eliminates 
one more read port in the data register file cell. 
[0038] Each bidirectional port 75, 77, 74, 76 in the reg- 
ister cell is connected either to even or odd registers, so 
that even registers dO, d2, ... d14 are accessed through 
even port 75, 77 and odd registers d1, d3, ... d15 are 
accessed through odd port 74, 76. This eliminates extra 
bidirectional ports in the data register cell. However, the 
data result needs to be aligned or steered to appropriate 
buses from outside either in data memory unit 1 0 for load 
and store instructions or in datafonward unit 8 for integer 
instructions. 

[0039] When integer, load and store instructions are 
issued in sequence, the data load and store buses 76 
and 77 and the data write back buses 74 and 75 inside 
the data register file 3 are occupied respectively by inte- 
ger and store instructions. The load data and destination 
register pointer stay in the load buffers until any of the 
buses 74 to 77 is made available. Meanwhile, the load 
buffers supply the data whenever the read pointer and 
destination pointer in the load buffers match. 
[0040] For an integer followed by a load instruction se- 
quence, the load data occupies the data load and store 
bus lines 76, 77 while the write back result of integer 
instruction occupies the data write back bus lines 74, 75. 
However, for load followed by store instruction sequence, 
the load data instead of using data load and store bus 



lines 76, 77, occupies the data write back bus lines 74, 
75 while the store data occupies the data load and store 
bus lines 76, 77. 

[0041] The address register file block 2 contains 16 

5 address registers. The basic register cell in the register 
has two read ports 41, 42 (see Figure 2E) and three bi- 
directional ports 40, 43, 44 or 40, 45, 46. However, both 
read ports 41 , 42 and one bidirectional port 40 connect 
to all sixteen registers aO, ... a15. Address arithmetic in- 

10 structions need only two read ports and one write port 
because they share the write-back buses. The load and 
store instructions use the load and store ports. The third 
bidirectional port is dedicated to loop- and jump-instruc- 
tions. As usual, the address result needs to be aligned 

15 or steered outside either in data memory unit 1 0 for load 
and store instructions or in the address forward unit 5 for 
address arithmetic instructions to be able to access even 
and odd bidirectional ports. 

[0042] Data forward unit 8 forwards the load or integer 
20 results in the execute stage of the current instruction to 
the read ports in the decode stage of next instruction to 
minimize data dependent latency. Address forward unit 
5 forwards the load or address arithmetic results in the 
execute stage of current instruction to the read ports in 
25 the decode stage of next instruction to minimize address 
dependent latency. The register contents are still read in 
the decode stage for next instruction even if the current 
instruction holds most recent data. The contents from the 
register are discarded if the current instruction has a most 
30 recent data result. If forward detection can be made for 
next instruction in the instruction fetch stage, register 
reading could be avoided for those operands whose ad- 
dress or data needs to be obtained from the results of 
current instruction in the execute stage hence saving 
35 power in the register file 1 . 

[0043] The integer followed by a load and store or an 
address arithmetic instruction could be issued in parallel. 
However, in this embodiment, if the result of the integer 
instruction needs to be forwarded to the address arith- 
metic pipeline, the instruction in the address arithmetic 
pipeline stalls for one cycle. The result fonwarding from 
the address arithmetic pipeline to the integer pipeline 
does not need to stall the integer pipeline because the 
address arithmetic, load or store followed by integer in- 
^5 structions are never issued in parallel. A 32-bit bidirec- 
tional bus provides data transfers between data forward- 
ing unit 8 and address forwarding unit 5. A special for- 
warding path allows execution of an integer followed by 
store instructions in parallel. The result from the integer 
50 pipeline in the execute stage is directly forwarded to the 
store buffers. 

[0044] The write back operation for integer instructions 
goes through write-back buffers 829, 830, 831 (see Fig- 
ure 2F) in data forwarding unit 8. The write back operation 
55 for address arithmetic, loop- and some jump- instructions 
go through write-back buffer 515 (see Figure 2E) in ad- 
dress forwarding unit 5. 

[0045] The load and store buffer blocks 6 and 9 are 
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functionally identical, except for the mask block 91 5 (see 
Figure 2B) in data load and store buffer 9. Data load and 
store buffer 9 and address load and store buffer 6 inter- 
face with the DIVIU 1 0 for context, load and store opera- 
tions. The data load and store buffer 9 provides an inter- 
face between data register file 7 and the DMU 10. The 
address load and store buffer 6 provides an interface 
between address register file 2 and the DMU 10. Even 
though the data path diagram of Figures 2A to 2F shows 
the configuration for bit 0, the same configuration applies 
to all bits in the diagram. 

[0046] Simple load and context load instructions use 

the load buffers 100, 105 (see Figure 2A), 908 and 913 
(LDB) for write back operation. These load buffers 100, 
1 05, 908 and 913 are separate from the write-back buff- 
ers in both data fonward unit 5 and address forward unit 
8. Context load instruction uses all four load buffers 1 00, 
105, 908 and 913 for staging two double words during 
context write back operation. The different load instruc- 
tions for double words use two load buffers for staging a 
double word during write back operation. The other load 
instructions and similar instructions use an even or odd 
load buffer of buffers 1 00, 1 05, 908 and 913 for staging 
a word for write back operation. 

[0047] The ALIGN sub-block 91 4 in data load and store 

buffer 9 handles zero and sign extension for byte and 
half-word load instructions. However, DMU 10 handles 
unpacking load data and data steering for unaligned, 
byte, half-word, word and double-word load operations 
in the load and store switch. 

[0048] Simple store and context store instructions use 
the store buffers (STB) 101 ... 104; 909 ... 912 for store 
operation. Context store instruction uses all four store 
buffers 101 ... 104; 909 ... 912 for staging two double 
words during context store operation. The double word 
store instructions use two store buffers for staging a dou- 
ble word during store operation. The other store instruc- 
tions and similar instructions use even or odd store buffer 
1 02, 1 04, 91 0, 91 2; 1 01 , 1 03, 909, 91 1 for staging a word 
for store operation. However, DMU 10 handles packing 
store data and data steering for unaligned, byte, half- 
word, word and double-word store operations in the load 
and store switch. 

[0049] The swap instruction for words requires register 
that contents be swapped with data memory contents 
which could be viewed as a single cycle read-modify- 
write operation. The load and store buffers 6 and 9 handle 
the swap operation. The swap instruction for words per- 
forms a load instruction for words and a store instruction 
for words in parallel. The load data from data memory 
gets into the load buffers 908 and 913 and store data 
from register file gets into the store buffers 909 .. 912 in 
execute stage. The store data goes out to the data mem- 
ory 1 0 and the load data gets into the data register file 7 
in the write back stage. The peripheral interface (not 
shown) and internal memory treat swap instructions as 
two separate operations. 

[0050] For a special load and mask instruction a single 



cycle read-modify-write operation is required, and this 
instruction performs operations similar to the above men- 
tioned swap instruction with a few minor changes. The 
load and mask instruction uses mask unit 91 5 to perform 

5 a mask operation. The load operation reads the data 
memory and updates the load buffers 908 and 913 but 
does not update registers dO ... d1 5. The store operation 
reads a double word from the registers dO ... d15 and 
stores only a single word into the data memory 10. The 

^0 data from data memory 1 0 gets into the load buffers 908 
and 913 and store data from register file 7 gets into the 
store buffers in the execute stage. The mask unit 915 
uses upper word from the store buffers 91 1 and 912 to 
merge the lower word with the output from the load buff- 

15 ers 908 and 913. The store data goes out to the data 
memory 10 in the write back stage. Between the data 
memory unit 10 and the general purpose register file, a 
cache memory can be optionally provided to improve ac- 
cess to the data stored in the memory. 

20 

Claims 

1 . A data processing unit (1 ) comprising 
25 a set (3) of data registers (dO, d1 , d1 5) and a set 

of address registers (aO, a1 , a15), each register 
having a width of n bits, and each set of data and 
address registers comprising even and odd data and 
address registers, respectively, 
30 first address load (100, 105) and store (102, 104) 
buffers associated with said even (aO, a1 , a14) 
address registers and second address load (100, 
105) and store (101, 103) buffers associated with 
said odd (a1, a3, a15) address registers, 
35 first data load (908, 913) and store (91 0, 912) buffers 
associated with said even (dO, d2, d14) data reg- 
isters and second data load (908, 913) and store 
(909, 911) buffers associated with said odd (d1, 
d3, d15) data registers, 

a bus having a plurality of bus lines (43 - 46, 74 - 77) 
comprising a width of 4n bits being connected to said 
first and second load and store buffers (100, 105, 
908, 913, 101 - 104, 909-912), 
adata memory unit (1 0) being connected to said bus, 
^5 wherein 2n-bit data words and 2n-bit address words 
can be transferred from or to said memory (1 0) to or 
from one of said even (dO, d2, d14), one of said 
odd (d1, d3, d15) data registers, one of said even 
(aO, a2, a14), and one of said odd (a1, a3, 
50 a15) address registers in parallel. 

2. Data processing unit (1 ) as in claim 1 , wherein each 
register has a plurality of ports and at least one of 
said bus lines (43 - 46, 74 - 77) is coupled in parallel 

55 to one of said ports of each register. 

3. Data processing unit (1 ) as in claim 2, wherein each 
register comprises a memory cell (400, 401, 700, 
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701 ) having an input terminal and an output terminal, 
a plurality of tri-state buffers (404, 405, 407, 408, 
409, 704 - 708) with inputs and outputs, said output 
terminal being connected to the inputs of said plu- 
rality of tri-state buffers (404, 405, 407, 408, 409, 
704 - 708). 

4. Data processing unit (1 ) as in claim 3, wherein each 
register further comprises a plu rality of transfer gates 
(402, 403, 406, 702, 703), said input terminal of said 
memory cell (400, 401, 700, 701) being connected 
to the outputs of said plurality of transfer gates (402, 
403, 406, 702, 703). 

5. Data processing unit (1) as in claim 1, further com- 
prising an address forward unit (5) associated with 
said address registers (aO, a1,..., a15) and a 
processing unit (12) connected to said address for- 
ward unit (5). 

6. Data processing unit as in claim 5, wherein each 
address register (aO, a1, a15) has a plurality of 
ports, the address forward unit (5) has a plurality of 
terminals, at least one of said terminals is coupled 
to one of said ports of at least one of said address 
registers (aO, a1, a15). 

7. Data processing unit (1 ) as in claim 6, wherein each 
register (aO, a1, a15) comprises a memory cell 
(400, 401) having an input terminal and an output 
terminal, aplurality of tri-state buffers (404, 405,407, 
408, 409) with inputs and outputs, said output termi- 
nal being connected to the inputs of said plurality of 
tri-state buffers (404, 405, 407, 408, 409). 

8. Data processing unit (1 ) as in claim 7, wherein each 
register (aO, a1 , a1 5) further comprises a plurality 
of transfer gates (402, 403, 406), said input terminal 
of said memory cell (400, 401) being connected to 
the outputs of said plurality of transfer gates (402, 
403, 406). 

9. Data processing unit (1 ) as in claim 1 , further com- 
prising a data forward buffer (8) associated with said 
data registers (dO, d1 , d15) and a processing unit 
(11) connected to said data forward buffer (B). 

10. Data processing unit (1 ) as in claim 9, wherein each 
data register (dO, d1 , d15) has a plurality of ports, 
the data forward unit has a plurality of terminals, at 
least one of said terminals is coupled to one of said 
portsof at least one of said data registers (dO, d1, 
d15). 

11. Data processing unit (1) as in claim 1 0, wherein each 
register (dO, d1, d15) comprises a memory cell 
(700, 701) having an input terminal and an output 
terminal, a plurality of tri-state buffers (704 - 708) 



with inputs and outputs, said output terminal being 
connected to the inputs of said plurality of tri-state 
buffers (704 - 708). 

5 12. Data processing unit (1 ) as in claim 1 1 , wherein each 
register (dO, d1 , d1 5) further comprises aplurality 
of transfer gates (702, 703), said input terminal of 
said memory cell (700, 701) being connected to the 
outputs of said plurality of transfer gates (702, 703). 

10 

13. Data processing unit (1) as in claim 1, wherein an 
align unit (914) is provided which connects even or 
odd registers to even or odd addresses of the mem- 
ory through said bus. 

15 

14. Data processing unit (1) as in claim 1, wherein a 
mask unit (91 5) is provided which is connected to at 
least one port of at least one of said registers. 



20 

Patentanspriiche 

1 . Datenverarbeitungseinheit (1 ), die enthalt: 

25 einen Satz (3) von Datenregistern (dO, d1, 

d1 5) und einen Satz von Adressenregistern (aO, 
a1, a15), wobei jedes Register eine Breite 
von n Bit hat und wobei jeder Satz von Daten- 
bzw. Adressenregistern geradzahlige bzw. un- 

30 geradzahlige Daten- und Adressen register ent- 

halt, 

erste Adressenlade- (100, 105) und Adressen- 
speicherpuffer (102, 104), die den geradzahli- 
gen Adressenregistern (aO, a1, a14) zuge- 

35 ordnet sind, und zweite Adressenlade- (100, 

105) und Adressenspeicherpuffer (101, 103), 
die den ungeradzahligen Adressenregistern 
(a1, a3, a15) zugeordnet sind, 
erste Datenlade- (908, 91 3) und Datenspeicher- 
puffer (910, 912), die den geradzahligen Daten- 
registern (do, d1 , d14) zugeordnet sind, und 
zweite Datenlade- (908, 913) und Datenspei- 
cherpuffer (909, 91 1), die den ungeradzahligen 
Datenregistern (d1, d3, d15) zugeordnet 

45 sind, 

einen Bus, der mehrere Busleitungen (43-46, 
74-77) besitzt, eine Breite von 4n Bit aufweist 
und mitden ersten bzw. zweiten Lade- und Spei- 
cherpuffern (100, 105, 908, 913, 101-104, 

50 909-912) verbunden ist, 

eine Datenspeichereinheit (1 0), die mit dem Bus 
verbunden ist, wobei 2n-Bit-Datenw6rter und 
2n-B it-Ad ressworter von oder zu dem Speicher 
(10) zu Oder von einem der geradzahligen Da- 

55 tenregister(dO, d2, d1 4), einem der ungerad- 

zahligen Datenregister (d1, d3, d15), einem 
der geradzahligen Adressenregister (aO, a2, 
a1 4) und einem der ungeradzahligen Adressen- 
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register (a1, a3, a15) parallel ubertragen 
warden konnen. 

2. Datenverarbeitungseinheit (1 ) nach Anspruch 1 , wo- 

bei jedes Register mehrere Ports besitzt und wenig- 5 
stens eine der Busleitungen (43-46, 74-77) mit ei- 
nem der Ports jedes Registers parallelgeschaltet ist. 

3. Datenverarbeitungseinheit (1 ) nach Anspruch 2, wo- 

bei jedes Register eine Speicherzelle (400, 401 , 700, io 
701), die einen Eingangsanschluss und einen Aus- 
gangsanschluss besitzt, und mehrere Tristate-Puf- 
fer (404, 405, 407, 408, 409, 704-708) mit Eingangen 
und Ausgangen enthalt, wobei der Ausgangsan- 
schluss mit den Eingangen der mehreren Tristate- ^5 
Puffer (404, 405, 407, 408, 409, 704-708) verbunden 
ist. 

4. Datenverarbeitungseinheit (1 ) nach Anspruch 3, wo- 
bei jedes Register ferner mehrere Ubertragungsgat- 20 
ter (402, 403, 406, 702, 703) aufweist, wobei der 
Eingangsanschluss der Speicherzelle (400, 401, 
700, 701) mit den Ausgangen der mehreren Uber- 
tragungsgatter (402, 403, 406, 702, 703) verbunden 

ist. 25 

5. Datenverarbeitungseinheit (1 ) nach Anspruch 1 , die 
ferner eine Adressenweiterleitungseinheit (5), die 
den Adressenregistern (aO, a1, a15) zugeordnet 

ist, und eine Verarbeitungseinheit (12), die mit der 30 
Adressenweiterleitungseinheit (5) verbunden ist, 
enthalt. 

6. Datenverarbeitungseinheit nach Anspruch 5, wobei 
jedes Adressenregister (aO, a1, a15) mehrere 35 
Ports besitzt, die Adressenweiterleitungseinheit (5) 
mehrere Anschlusse besitzt und wenigstens einer 
der Anschlusse mit einem der Ports von wenigstens 
einem der Adressenregister (aO, a1 , a1 5) verbun- 
den ist. 40 

7. Datenverarbeitungseinheit (1 ) nach Anspruch 6, wo- 
bei jedes Register (aO, a1, a15) eine Speicher- 
zelle (400, 401) mit einem Eingangsanschluss und 
einem Ausgangsanschluss und mehrere Tristate- ^5 
Puffer (404, 405, 407, 408, 409) mit Eingangen und 
Ausgangen enthalt, wobei der Ausgangsanschluss 

mit den Eingangen der mehreren Tristate- Puffer 
(404, 405, 407, 408, 409) verbunden ist. 

50 

8. Datenverarbeitungseinheit (1 ) nach Anspruch 7, wo- 
bei jedes Register (aO, a1, a15) ferner mehrere 
Ubertragungsgatter (402, 403, 406) aufweist, wobei 
der Eingangsanschluss der Speicherzelle (400, 401 ) 

mit den Ausgangen der mehreren Ubertragungsgat- 55 
ter (402, 403, 406) verbunden ist. 

9. Datenverarbeitungseinheit (1 ) nach Anspruch 1 , die 
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ferner einen Datenweiterleitungspuffer (8), der den 
Daten register n (dO, d1 , d15) zugeordnet ist, und 
eine Verarbeitungseinheit (11), die mit dem Daten- 
weiterleitungspuffer (8) verbunden ist, enthalt. 

10. Datenverarbeitungseinheit (1) nach Anspruch 9, wo- 
bei jedes Datenregister (dO, d1, d15) mehrere 
Ports besitzt, die Datenweiterleitungseinheit mehre- 
re Anschlusse besitzt und wenigstens einer der An- 
schlusse mit einem der Ports von wenigstens einem 
der Datenregister (dO, d1 , d15) verbunden ist. 

11. Datenverarbeitungseinheit (1) nach Anspruch 10, 
wobei jedes Register (dO, d1 , d15) eine Speicher- 
zelle (700, 701) mit einem Eingangsanschluss und 
einem Ausgangsanschluss und mehrere Tristate- 
Puffer (704, 708) mit Eingangen und Ausgangen ent- 
halt, wobei der Ausgangsanschluss mit den Eingan- 
gen der mehreren Tristate- Puffer (704-708) verbun- 
den ist. 

12. Datenverarbeitungseinheit (1) nach Anspruch 11, 
wobeijedes Register (dO, d1 , d1 5)ferner mehrere 
Ubertragungsgatter (702, 703) enthalt, wobei der 
Eingangsanschluss der Speicherzelle (700, 701 ) mit 
den Ausgangen der mehreren Ubertragungsgatter 
(702, 703) verbunden ist. 

1 3. Datenverarbeitungseinheit (1 ) nach Anspruch 1 , wo- 
bei eine Synchronisationseinheit (914) vorgesehen 
ist, die geradzahlige oder ungeradzahlige Register 
mit geradzahligen oder ungeradzahligen Adressen 
des Speichers uber den Bus verbindet. 

14. Datenverarbeitungseinheit (1 ) nach Anspruch 1 , wo- 
bei eine Maskierungseinheit (915) vorgesehen ist, 
die mit wenigstens einem Port wenigstens eines der 
Register verbunden ist. 



Revendications 

1 . Unite de traitement de donnees (1 ) comprenant 

- un ensemble (3) de registres de donnees (dO, 
d1 , ...,d15)etun ensemble de registres d'adres- 
ses (aO, a1 , a15), chaque registre ayant une 
largeur de n bits, et chaque ensemble de regis- 
tres de donnees et d'adresses comprenant des 
registres de donnees et d'adresses paires et im- 
paires et, respectivement, 

- des premiers tampons de chargement (100, 
105) et de stockage (102, 104) d'adresses as- 
socies aux registres d'adresses paires (aO, 
a1 , a4) et des deuxiemes tampons de char- 
gement (100, 105) et de stockage (101, 103) 
d'adresses associes aux registres d'adresses 
impaires (a1, a3, ... , a15). 
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- des premiers tampons de chargement (908, 
913) et de stockage (910, 912) de donnees as- 
socies aux reglstres de donnees paires (dO, 
d2, ... , d1 4) et des deuxiemes tampons de char- 
gement (908, 913) etde stockage (909, 911) de 5 
donnees associes aux registres de donnees im- 
paires (d1, d3, d15), 

- un bus possedant une pluralite de lignes de 
bus (43-46, 74-77) comprenant une largueur de 
4n bits, et etant connecte aux premiers et 
deuxiemes tampons de chargement et de stoc- 
kage (1 00, 1 05, 908, 91 3, 1 01 -1 04, 909-91 2), et 

une unite de memoire de donnees (10) etant con- 
nectee au bus, ^5 
dans laquelle des mots de donnees de 2n bits etdes 
mots d'adresses de 2n bits peuvent etre transferes 
depuis ou vers la memoire (10) vers ou depuis I'un 
des registres de donnees paires (dO, d2, d14), 
I'un des registres de donnees impaires (d1, d3, 20 
d15) , I'un des registres d'adresses paires (aO, 
a2, ... , a14), et I'un des registres d'adresses impai- 
res (a1, a3, a15) en parallele. 



2. Unite de traitement de donnees (1) selon la reven- 25 
dication 1, dans laquelle chaque registre possede 
une pluralite de ports et au moins I'une des lignes 

de bus (43-46, 74-77) est couplee en parallele a I'un 
des ports de chaque registre. 

30 

3. Unite de traitement de donnees (1) selon la reven- 
dication 2, dans laquelle chaque registre comprend 
une cellule de memoire (400, 401, 700, 701) ayant 
une borne d'entree et une borne de sortie, une plu- 
ralite de tampons a trois etats (404, 405, 407, 408, 35 
405, 704-708) avec des entrees et des sorties, la 
borne de sortie etant connectee aux entrees de la 
pluralite de tampons a trois etats (404, 405, 407, 
408, 409, 704-708). 

40 

4. Unite de traitement de donnees (1) selon la reven- 
dication 3, dans laquelle chaque registre comprend 
en outre une pluralite de passerelles de transfert 
(402, 403, 406, 702, 703), la borne d'entree de la 
cellule de memoire (400, 401, 700, 701) etant con- ^5 
nectee aux sorties de la pluralite de passerelles de 
transfert (402, 403, 406, 702, 703). 

5. Unite de traitement de donnees (1) selon la reven- 
dication 1 , comprenant en outre une unite d'achemi- 50 
nement d'adresse (5) associee aux registres 
d'adresses (aO, a1, a15) et une unite de traite- 
ment (12) connectee a la unite d'acheminement 
d'adresse (5) . 

55 

6. Unite de traitement de donnees selon la revendica- 
tion 5, dans laquelle chaque registre d'adresses (aO, 
a1, a15) possede une pluralite de ports, I'unite 



d'acheminementd'adresse (5) possede une pluralite 
de bornes, au moins I'une des bornes est couplee a 
I'un des ports d'au moins I'un des registres d'adres- 
ses (aO, a1 , a15). 

7. Unite de traitement de donnees (1) selon la reven- 
dication 6, dans laquelle chaque registre (aO, a1 , 
a15) comprend une cellule de memoire (400, 401) 
ayant une borne d'entree et une borne de sortie, un 
pluralite de tampons a trois etats (404, 405, 407, 
408, 409) avec des entrees et des sorties, la borne 
de sortie etant connectee aux entrees de la pluralite 
de tampons a trois etats (404, 405, 407, 408, 409). 

8. Unite de traitement de donnees (1) selon la reven- 
dication 7, dans laquelle chaque registre (aO, a1 , 
a15) comprend en outre une pluralite de passerelles 
de transfert (402, 403, 406), la borne d'entree de la 
cellule de memoire (400, 401) etant connectee aux 
sorties de la pluralite de passerelles de transfert 
(402, 403, 406). 

9. Unite de traitement de donnees (1) selon la reven- 
dication 1 , comprenant en outre un tampon d'ache- 
minement de donnees (8) associe aux registres de 
donnees (dO, d1 , d1 5) et une unite de traitement 
(1 1 ) connectee au tampon d'acheminement de don- 
nees (B) . 

10. Unite de traitement de donnees (1) selon la reven- 
dication 9, dans laquelle chaque registre de donnees 
(do, d1 , d1 5) possede une pluralite de ports, I'uni- 
te d'acheminement de donnees possede une plura- 
lite de bornes, au moins I'une des bornes est couplee 
a I'un des ports d'au moins I'un des registres de don- 
nees (do, d1, .... d15). 

11. Unite de traitement de donnees (1) selon la reven- 

dication 1 0, dans laquelle chaque registre (dO, d1 , 
d15) comprend une cellule de memoire (700, 701) 
ayant une borne d'entree et une borne de sortie, une 
pluralite de tampons a trois etat (704-708) avec des 
entrees et des sorties, la borne de sortie etant con- 
nectee aux entrees de la pluralite de tampons a trois 
etats (704-708). 

12. Unite de traitement de donnees (1) selon la reven- 
dication 1 1, dans laquelle chaque registre (d0,d1, 
d15) comprend en outre une pluralite de passerelles 
de transfert (702, 703), la borne d'entree de la cellule 
de memoire (700, 701) etant connectee aux sorties 
de la pluralite de passerelles de transfert (702, 703). 

13. Unite de traitement de donnees (1) selon la reven- 
dication 1, dans laquelle une unite d'alignement 
(914) connecte les registres pairs ou impairs a des 
adresses paires ou impaires de la memoire par le 
biais, du bus. 
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Unite de traitement de donnees (1) selon la reven- 
dication 1 , dans laquelle une unite de masquage 
(915) est connectee a au moins un port d'au moins 
Tun des registres. 
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